Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?

Identifieur interne : 000C35 ( Main/Exploration ); précédent : 000C34; suivant : 000C36

Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?

Auteurs : Weihua Huang [Singapour] ; Lim Tan [Singapour] ; Jiuzhou Zhao [Singapour]

Source :

RBID : ISTEX:99E7AF6A47BC65ECF49F1937BDE7FF0FB1F3D0AB

Abstract

Abstract: Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating a chart image dataset and multi-level ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dataset as well as the ground truth data are publicly available so that other researchers can access them for evaluating and comparing performances of different systems.

Url:
DOI: 10.1007/978-3-540-88188-9_25


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?</title>
<author>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
</author>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
</author>
<author>
<name sortKey="Zhao, Jiuzhou" sort="Zhao, Jiuzhou" uniqKey="Zhao J" first="Jiuzhou" last="Zhao">Jiuzhou Zhao</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:99E7AF6A47BC65ECF49F1937BDE7FF0FB1F3D0AB</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/978-3-540-88188-9_25</idno>
<idno type="url">https://api.istex.fr/document/99E7AF6A47BC65ECF49F1937BDE7FF0FB1F3D0AB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000899</idno>
<idno type="wicri:Area/Istex/Curation">000889</idno>
<idno type="wicri:Area/Istex/Checkpoint">000702</idno>
<idno type="wicri:doubleKey">0302-9743:2008:Huang W:generating:ground:truthed</idno>
<idno type="wicri:Area/Main/Merge">000C47</idno>
<idno type="wicri:Area/Main/Curation">000C35</idno>
<idno type="wicri:Area/Main/Exploration">000C35</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?</title>
<author>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
<author>
<name sortKey="Zhao, Jiuzhou" sort="Zhao, Jiuzhou" uniqKey="Zhao J" first="Jiuzhou" last="Zhao">Jiuzhou Zhao</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive 2, 117543</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Singapour</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2008</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">99E7AF6A47BC65ECF49F1937BDE7FF0FB1F3D0AB</idno>
<idno type="DOI">10.1007/978-3-540-88188-9_25</idno>
<idno type="ChapterID">25</idno>
<idno type="ChapterID">Chap25</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Ground truthing tools mainly fall into two categories: automatic and semi-automatic. In this paper, we first discuss the pros and cons of the two approaches. We then report our own work on designing and implementing systems for generating a chart image dataset and multi-level ground truth data. Both semi-automatic and automatic approaches were adopted, resulting in two independent systems. The dataset as well as the ground truth data are publicly available so that other researchers can access them for evaluating and comparing performances of different systems.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
<orgName>
<li>Université nationale de Singapour</li>
</orgName>
</list>
<tree>
<country name="Singapour">
<noRegion>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
</noRegion>
<name sortKey="Huang, Weihua" sort="Huang, Weihua" uniqKey="Huang W" first="Weihua" last="Huang">Weihua Huang</name>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<name sortKey="Tan, Lim" sort="Tan, Lim" uniqKey="Tan L" first="Lim" last="Tan">Lim Tan</name>
<name sortKey="Zhao, Jiuzhou" sort="Zhao, Jiuzhou" uniqKey="Zhao J" first="Jiuzhou" last="Zhao">Jiuzhou Zhao</name>
<name sortKey="Zhao, Jiuzhou" sort="Zhao, Jiuzhou" uniqKey="Zhao J" first="Jiuzhou" last="Zhao">Jiuzhou Zhao</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000C35 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000C35 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:99E7AF6A47BC65ECF49F1937BDE7FF0FB1F3D0AB
   |texte=   Generating Ground Truthed Dataset of Chart Images: Automatic or Semi-automatic?
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024